Power laws for family sizes in a duplication model

نویسندگان

  • Rick Durrett
  • Jason Schweinsberg
چکیده

Qian, Luscombe, and Gerstein (2001) introduced a model of the diversification of protein folds in a genome that we may formulate as follows. Consider a multitype Yule process starting with one individual in which there are no deaths and each individual gives birth to a new individual at rate one. When a new individual is born, it has the same type as its parent with probability 1 − r and is a new type, different from all previously observed types, with probability r. We refer to individuals with the same type as families and provide an approximation to the joint distribution of family sizes when the population size reaches N . We also show that if 1 S N1−r, then the number of families of size at least S is approximately CNS−1/(1−r), while if N1−r S the distribution decays more rapidly than any power. Running head: Power laws for gene family sizes. ∗Partially supported by NSF grants from the probability program (0202935) and from a joint DMS/NIGMS initiative to support research in mathematical biology (0201037). †Supported by an NSF Postdoctoral Fellowship. AMS 2000 subject classifications. Primary 60J80; Secondary 60J85, 92D15, 92D20.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

4 Power laws for family sizes in a duplication model

Qian, Luscombe, and Gerstein (2001) introduced a model of the diversification of protein folds in a genome that we may formulate as follows. Consider a multitype Yule process starting with one individual in which there are no deaths and each individual gives birth to a new individual at rate one. When a new individual is born, it has the same type as its parent with probability 1 − r and is a n...

متن کامل

0 Ju n 20 04 Power laws for family sizes in a duplication model

Qian, Luscombe, and Gerstein (2001) introduced a model of the diversification of protein folds in a genome that we may formulate as follows. Consider a multitype Yule process starting with one individual in which there are no deaths and each individual gives birth to a new individual at rate one. When a new individual is born, it has the same type as its parent with probability 1 − r and is a n...

متن کامل

. PR ] 1 1 Fe b 20 05 Power laws for family sizes in a duplication model

Qian, Luscombe, and Gerstein (2001) introduced a model of the diversification of protein folds in a genome that we may formulate as follows. Consider a multitype Yule process starting with one individual in which there are no deaths and each individual gives birth to a new individual at rate one. When a new individual is born, it has the same type as its parent with probability 1 − r and is a n...

متن کامل

BMC Evolutionary Biology

Background. Power distributions appear in numerous biological, physical and other contexts, which appear to be fundamentally different. In biology, power laws have been claimed to describe the distributions of the connections of enzymes and metabolites in metabolic networks, the number of interactions partners of a given protein, the number of members in paralogous families, and other quantitie...

متن کامل

Evolutionary implications of a power-law distribution of protein family sizes

1 Current-day genomes bear the mark of the evolutionary processes. One of the strongest indications is the sequence homology among families of proteins that perform similar biological functions in different species. The number of proteins in a family can grow over time as genetic information is duplicated through evolution. We explore how evolution directs the size distribution of these familie...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004